Language Model and Speaking Rate Adaptation for Spontaneous Presentation Speech Recognition

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Unsupervised class-based language model adaptation for spontaneous speech recognition

This paper proposes an unsupervised, batch-type, class-based language model adaptation method for spontaneous speech recognition. The word classes are automatically determined by maximizing the average mutual information between the classes using a training set. A class-based language model is built based on recognition hypotheses obtained using a general word-based language model, and linearly...

متن کامل

Speaking rate dependent acoustic modeling for spontaneous lecture speech recognition

The paper addresses large vocabulary spontaneous speech recognition focusing on acoustic modeling that considers the speaking rate. Using the real lecture speech corpus collected under the priority research project in Japan, we have made baseline acoustic model, and evaluated on the automatic transcription of oral presentations by experienced speakers and obtained word accuracy of 58.2%. Compar...

متن کامل

Frame-period adaptation for speaking rate robust speech recognition

This paper describes a frame-period adaptation method for speaking rate robust speech recognition. The proposed method determines an appropriate frame-period for each phrase by measuring its speaking rate or computing the acoustic likelihood with a set of frame-periods. Experimental results on spontaneous speech recognition show that the proposed method is effective for slower utterance. Actual...

متن کامل

Unsupervised language model adaptation methods for spontaneous speech

In this paper we examine the performance of three different unsupervised language model adaptation schemes applied to speech recognition of spontaneous speech lecture presentations. Two of the schemes have been described previously in the literature while the third is a variation of one of the other two schemes. All three schemes are based on a combination of word -gram and class -gram models a...

متن کامل

Dynamic language model adaptation using presentation slides for lecture speech recognition

We propose a dynamic language model adaptation method that uses the temporal information from lecture slides for lecture speech recognition. The proposed method consists of two steps. First, the language model is adapted with the text information extracted from all the slides of a given lecture. Next, the text information of a given slide is extracted based on temporal information and used for ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Speech and Audio Processing

سال: 2004

ISSN: 1063-6676

DOI: 10.1109/tsa.2004.828641